A parallel computational framework for ultra-large-scale sequence clustering analysis
نویسندگان
چکیده
منابع مشابه
Parallel Hierarchical Clustering in Linearithmic Time for Ultra-Large-Scale Sequence Analysis Supplementary Data
Parallel Hierarchical Clustering in Linearithmic Time for Ultra-Large-Scale Sequence Analysis Supplementary Data Qi Mao1, Wei Zheng2, Li Wang4, Yunpeng Cai5, Volker Mai6, Yijun Sun1,2,3∗ Department of Microbiology and Immunology, Department of Computer Science and Engineering, Department of Biostatistics, The State University of New York at Buffalo, Buffalo, NY 14203, USA. The Institute for Com...
متن کاملA partition-based algorithm for clustering large-scale software systems
Clustering techniques are used to extract the structure of software for understanding, maintaining, and refactoring. In the literature, most of the proposed approaches for software clustering are divided into hierarchical algorithms and search-based techniques. In the former, clustering is a process of merging (splitting) similar (non-similar) clusters. These techniques suffered from the drawba...
متن کاملLarge-scale parallel data clustering
Algorithmic enhancements are described that enable large computational reduction in mean square-error data clustering. These improvements are incorporated into a parallel data-clustering tool, P-CLUSTER, designed to execute on a network of workstations. Experiments involving the unsupervised segmentation of standard texture images were performed. For some data sets, a 96 percent reduction in co...
متن کاملA Design Framework for Ultra-Large-Scale Autonomic Systems
The origins of ultra-large-scale (ULS) systems derive from social problems that are getting more and more complex, such as climatic monitoring, transportation, citizens protection and security. These factors imply a continuous increase of information systems that evolve towards ultra-dimension systems, requiring digital communication networks that allow for communication between people, between...
متن کاملA Parallel Algorithm for Large-scale Multiple Sequence Alignment
Multiple sequence alignment is a central topic of extensive research in computational biology. Basically, two or more protein sequences are compared to evaluate their similarity and to identify conserved regions. This work reports a methodology for parallel processing of a multiple sequence alignment algorithm (ClustalW) in an environment of networked computers. A detailed description of the mo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Bioinformatics
سال: 2018
ISSN: 1367-4803,1460-2059
DOI: 10.1093/bioinformatics/bty617